HEMIN: A Cryptographic Approach for Private k-NN Classification
نویسندگان
چکیده
Data mining is frequently obstructed by privacy concerns. In many cases, data is shared with third party for the mining purpose. However, sharing of data for analysis is not possible due to prevailing privacy laws and/or policies. Privacy preserving data mining techniques have been developed to address this issue by providing mechanisms to mine the data while guaranteing certain levels of privacy. In this paper, we address the issue of privacy preserving nearest neighbor search, which forms the basis of many data mining applications. We propose a scheme HEMIN for sharing the data to be mined by way of homomorphic encryption scheme at the data owner end. The nearest neighbors of a given data point is computed while the data point as well as the data points in training set are in encrypted form. This is being done using the third party which is assumed to be semi honest party. The efficiency of the model has been shown over three datasets namely, Iris, German credit database and Australian credit dataset. The misclassification ratio has been computed to show the efficiency of the k-NN classifier. The results show the effectiveness of the proposed method and suggests its applicability for various parametric data mining algorithms.
منابع مشابه
A hybridization of evolutionary fuzzy systems and ant Colony optimization for intrusion detection
A hybrid approach for intrusion detection in computer networks is presented in this paper. The proposed approach combines an evolutionary-based fuzzy system with an Ant Colony Optimization procedure to generate high-quality fuzzy-classification rules. We applied our hybrid learning approach to network security and validated it using the DARPA KDD-Cup99 benchmark data set. The results indicate t...
متن کاملA comparative study of performance of K-nearest neighbors and support vector machines for classification of groundwater
The aim of this work is to examine the feasibilities of the support vector machines (SVMs) and K-nearest neighbor (K-NN) classifier methods for the classification of an aquifer in the Khuzestan Province, Iran. For this purpose, 17 groundwater quality variables including EC, TDS, turbidity, pH, total hardness, Ca, Mg, total alkalinity, sulfate, nitrate, nitrite, fluoride, phosphate, Fe, Mn, Cu, ...
متن کاملThe Privacy of k-NN Retrieval for Horizontal Partitioned Data -- New Methods and Applications
Recently, privacy issues have become important in clustering analysis, especially when data is horizontally partitioned over several parties. Associative queries are the core retrieval operation for many data mining algorithms, especially clustering and k-NN classification. The algorithms that efficiently support k-NN queries are of special interest. We show how to adapt well-known data structu...
متن کاملOptimized Seizure Detection Algorithm: A Fast Approach for Onset of Epileptic in EEG Signals Using GT Discriminant Analysis and K-NN Classifier
Background: Epilepsy is a severe disorder of the central nervous system that predisposes the person to recurrent seizures. Fifty million people worldwide suffer from epilepsy; after Alzheimer’s and stroke, it is the third widespread nervous disorder.Objective: In this paper, an algorithm to detect the onset of epileptic seizures based on the analysis of brain electrical signals (EEG) has b...
متن کاملComparison between Pixel-based and Object-based Classifications Using Radar Satellite Image in Extracting Massive Flood Extent at Northern Region of Peninsular Malaysia
Year 2010 massive flood hit the northern region of Peninsular Malaysia particularly Perlis and Kedah involved several districts and destroyed many agricultural areas and the infrastructure. This study focuses on the comparison between pixel-based classification and object-based classification of five machine learning algorithms including Parallelepiped (PP), Minimum Distance (MD), Maximum Likel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008